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PLANT MYB PROTEINS 

RELATED APPLICATION 

This application claims priority to U.S. Provisional Application Serial No. 60/399,999, 
filed on July 31, 2002, the contents of which are incorporated by reference in their entirety. 

BACKGROUND 

MYB proteins are a group of transcription factors. The cellular myb proto-oncogene 
(c-myb) is involved in the proliferation and/or differentiation of haematopoietic cells (Graf 
(1992) Curr. Op. Gen. Devel. 2, 249-255). Common to all MYB gene products is a strongly 
conserved DNA-binding domain located at the N-terminal end. In animals, this DNA-binding 
domain consists of three imperfect repeats of approximately 51-52 amino acids each (designated 
Rl, R2, and R3) with three highly conserved tryptophan residues regularly spaced by 18 or 19 
amino acids (Weston (1999) Curr. Op. Gene Devel. 8, 76-81). In plants and yeast, the 
predominant MYB proteins have two repeats (R2 and R3) (Martin and Paz- Ares (1997) Trends 
Genet. 13, 67-73; Jin and Martin (1999) Plant Mol. Biol. 41, 577-585). However, MYB proteins 
containing only one repeat or three repeats have also been identified in plants (Baranowskij, et 
al. (1994) EMBO J 13, 5383-5392; Kirik and Baumlein (1996) Gene 183, 109-113; Feldbriigge, 
et al. (1997) Plant J. 11, 1079-1093; Wang, et al. (1997) Plant Cell 9, 491-507; Braun and 
Grotewold (1999) Plant Physiol. 121, 21-24). 

SUMMARY 

The invention is based on the discovery of three novel rice genes encoding Myb proteins, 
designated OsMYBSl, OsMYBS2, and OsMYBS3, respectively. These proteins bind to 
promoters containing one or more copies of a TATCCA sequence and regulate the promoter 
activity. 

The full-length OsMYBSl, OsMYBS2, and OsMYBS3 cDNAs (designated SEQ ID 
NOs:l-3, respectively), with the start and stop codons underlined, are shown below: 
OsMYBSl cDNA: 

GTGCGAGATCCACCACCCGATGACCTCCCAGGCGGCGACGACGACGACCACGGCGGCGGCGGCGGCGGCGTGGACCA 

GGGAGGACGACAAGGCGTTCGAGAACGCGCTCGCGGCTTGCGCGGCGCCGCCGCCCGCGGACGGAGGCGCGCCCGAC 

1 
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GACGACTGGTTCGCCGCGCTCGCCGCGAGCGTGCCCGGGGCGAGGTCGGCGGAGGAGGTGCGGAGGCACTACGAGGC 
GCTGGTGGAGGACGTCGCGGCCATCGACGCGGGCCGCGTCCCGCTCCCGCGCTACGCCGGGGAGGAGTCCGCGGCGC 
CGCCCGACGGAGCCGGAGCCGCCGCCGCCGCGTCCAAGGACGGCGGACACCGGCGCGACGAGCGCAAGGGCGGCGGC 
GGCGGGTACGACGGCGGCAAGAGCTGCTCCAAGGCGGAGCAGGAGAGGCGCAAGGGCATCCCATGGACGGAGGAAGA 
GCACAGGCTGTTCTTGCTGGGGCTGGACAAGTTCGGCAAGGGGGACTGGCGGAGCATCTCGCGCAACTTCGTCATCT 
CGCGGACGCCAACGCAGGTGGCGAGCCACGCGCAGAAGTACTTCATCCGCCTCAACTCCATGAACCGCGACCGCCGC 
CGCTCCAGCATCCACGACATCACCAGCGTCACCGCCGGCGATCAGGTCGCCGCGCAGCAGGGCGCCCCGATCACCGG 
CCACCAGGCCACGGGCAACCCCGCGGCGGCGGCGCTGGGCCCGCCGGGCATGAAGCACCACCACCACCACCACCCGG 
GCGGCGCGCCGCCGCCCATGCCCATGTACAGCGCCGCGCCCATGGGCCACCCCGTCGCCGGCCACATGGTGCCCGCC 
GCCGTCGGCACGCCGGTGGTGTTCCCGCCGGGCCACGCGCCGTACGTCGTGCCCGTCGGCTACCCGGCGCCTCCGGC 
CAAGATGCACCAATGACGCGCCATGGACGGACATGAGCAGCATTTCTTCCTCCTCCTTTCTTGATGTCAATCTTGAT 
TTGTTCTTTGTGTAGTCGCCGGCTCATCGTCCCTGATCATCTTGTTCTTCTCACAATCTCACTAATGTAAACATACA 

TAG AT C AG AT G C C AAG AG T G C AG G GAT T G GGG AT T AAAG G C GAAT AAG T AAAG TAT T T T G C T G AC T G T T T G C AAGT G 
ATCATCACGTACACCCGGTGAAAGCTTAGCTCCAAATGTGGATGTAATTAGCAGCGGCCTTCCGTACGTGGTGGCGC 
CGATCGATGATCTTGCAGGGGTTGCAATTAGGGATTGATTTCCATTTTGCTGATGTAAATTTGCCAACTGTCTCATT 

GGACCAAAAAAAAAAAAAAAA (SEQ ID NO : 1 ) 

OsMYBS2 cDNA: 

CGAGGTCCGCGGCGGCGGCGGCGGAGTTGACGAGGAGGAGTACGAGGAGGAGGAGGTGGAGGGTGGATTGTTCATCA 
AGAAGAGCTCCAGTATGCCCAACCTCACCTCCATCGACCCGCTGCCGGTGCCGGCCGACGGCGGCAAACGGCGCGCC 
TCCGACGACTCCGAGCTCGCCTCCGGCCAGCAGAAGCGCCGCCGCCGCAAGGTGCAGGAGAGGAAGAAAGGGGTACC 
ATGGACTGAGGAGGAGCACAAGAAATTCCTGGAAGGGCTGAGGCAGCTGGGGAAAGGGGACTGGAGAGGCATCTCCA 
AGAACTTTGTGACCAGCAGGACGGCGACTCAGGTGGCCAGCCACGCCCAGAAGTACTTCCTCCGGCAGACCAACCCT 
GGCAAAAAGAAGCGCCGGGCCAGCCTCTTTGATGTTGTTGCTGAGTGCAGTGATGATCAACTTCCAAGTCCTCAGAG 
TGTTGGAACTAAGCCTCCTACCCAGGATATAATTCATACAGATCGCGGCGATGTCCCGATACTAAGCTATCCAGTTG 
CTAGAGGCTTTAGAGGCGATAGCGTGCAGGTTGATGAACTAACTGAATATGTGAAGAGATTAAAGGCCGCCGAGGAC 
ATGTCGCTCTCCATGATCTCTGGACTGGAAATGGCATCATCATCCATCAGCAGTCTAGAGCTCAGTATCGCGCCCTC 
TCATTTGCGGATCGACGGGGCCATCAAGGGGCTGGGATCCAAACCCAATTTTCCCCCGAAGGAATTTGGATCGGCTT 
CAGCTACTGTTTTTTGTCCCCCCTGTTGTTGTTTGTTGTTGTTGTTTTTTTTTTTTTTTTTTGCGGGGGTTGTTTGT 
TGTTGTTGTTGTTGTAGTTGTCATGCTAACTTTGTATTTGGGTCATGTGGGGTTTCTTTCACCAGTTTTATATAATA 
C AG AG AG AAT G T C AG TCCCTTCC GAG AC AT G T T T AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAA (SEQ ID NO: 2) 

OsMYBS3 cDNA: 

ATCGATCGATCGATCTCCATAGGTGGGGGAAGGGAAGCTTTGGAAGGTGGAGGGACGGAGGGGGGGATGACGAGGCG 
GTGCTCGCACTGCAGCCACAACGGGCACAACTCGCGGACGTGCCCCAACCGCGGGGTCAAGATCTTCGGGGTGCGCC 
TCACCGATGGCTCCATCCGCAAGAGCGCCAGCATGGGGAACCTCTCCCTCCTCTCCTCCGCCGCCGGATCCACCAGC 
GGCGGCGCCTCCCCCGCCGACGGCCCCGACGCCGCCCCCACCGCCGCCGACGGCTACGCCTCCGACGACTTCGTCCA 
GGGCTTCTCCTCCGCCACCCGCGACCGCAAGAAGGGTGTTCCTTGGACTGAAGAAGAACACCGGAGGTTTTTGCTTG 
GATTGCAAAAGCTTGGCAAAGGTGATTGGCGAGGAATCTCTCGTAATTTCGTGGTCTCAAGAACACCTACTCAAGTA 
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GCCAGTCATGCTCAGAAATATTTTATACGCCAATCCAATATGACCAGAAGGAAAAGAAGGTCTAGCCTTTTTGACAT 
GGTGCCAGATGAGTCTATGGACCTTCCACCACTTCCTGGAGGTCAAGAACCAGAGACCCAAGTATTAAATCAACCAG 
CACTACCTCCACCGAAGGAGGAAGAGGAGGTAGATTCTATGGAGTCAGATACTTCTGCCGTTGCAGAGAGCTCTTCC 
GCTTCTGCTATCATGCCAGATAATTTGCAGTCGACCTATCCAGTGATTGTTCCAGCTTATTTCTCGCCCTTTTTGCA 
ATTCTCGGTTCCTTTCTGGCAAAATCAGAAAGATGAAGATGGTCCTGTGCAAGAAACACATGAGATTGTCAAGCCTG 
TTCCAGTTCATTCAAAGAGCCCAATCAACGTTGATGAGCTTGTTGGCATGTCGAAGCTCAGCATAGGAGAGTCCAAT 

C AAG AG AC AGAG TCTACTTCTCTTT CAT T AAAT C T G G T AG G AGG T C AAAAT AG AC AAT C AGC T T T C C AT G C AAAT C C 
ACCAACAAGGGCACAGGCATGATCTGGTTGTGCACACAACTGCATTTAGATGAATCCCAGGCAAAATAAGCTTTGCC 
TCCTTGTTTTTTTGTTTTTATTTTAAGATT71ACCGTTCTCCGTAGTCTGTATCATGTGCTGTAAGTTATGCTATGTA 
TGAATGTATCTGTTGTTT'GTCTGGCACACATGATAAATCACTCTATGTTAACAAAATCAGTAATGGTAGTGCTGATC 
TTCGTGGTTGTACTGTTGTAAACTCTTTTATAAGAAAAAAAAATATTAGTTAGTC (SEQ ID NO: 3) 

The nucleic acid sequences encoding the OsMYBSl, OsMYBS2, and OsMYBS3 proteins (i.e., 
from the ATG start codon to the codon immediately before the stop codon in SEQ ID NOs:l-3) 
are designated SEQ ID NOs:4-6, respectively. The OsMYBSl, OsMYBS2, and OsMYBS3 
proteins (designated SEQ ID NOs:7-9) encoded by the above cDNAs are shown below: 
OsMYBSl protein: 

MTSQAATTTTTAAAAAAWTREDDKAFENALAACAAPPPADGGAPDDDWFAALAASVPGARSAEEVRRHYEALVEDVA 
AIDAGRVPLPRYAGEESAAPPDGAGAAAAASKDGGHRRDERKGGGGGYDGGKSCSKAEQERRKGIPWTEEEHRLFLL 
GLDKFGKGDWRS I SRNFVI SRT PTQVASHAQKYFIRLNSMNRDRRRS S I HDI TS VTAGDQVAAQQGAPI TGHQATGN 
PAAAALGPPGMKHHHHHHPGGAPPPMPMYSAAPMGHPVAGHMVPAAVGTPVVFPPGHAPYVVPVGYPAPPAKMHQ 

(SEQ ID NO: 7) 

OsMYBS2 protein: 

MPNLTSIDPLPVPADGGKRRASDDSELASGQQKRRRRKVQERKKGVPWTEEEHKKFLEGLRQLGKGDWRGISKNFVT 
SRTATQVASHAQKYFLRQTNPGKKKRRASLFDVVAECSDDQLPSPQSVGTKPPTQDIIHTDRGDVPILSYPVARGFR 
GDSVQVDELTEYVKRLKAAEDMSLSMISGLEMASSSISSLELSIAPSHLRIDGAIKGLGSKPNFPPKEFGSASATVF 
CPPCCCLLLLFFFFFFAGVVCCCCCCSCHANFVFGSCGVSFTSFI (SEQID NO: 8) 

OsMYBS3 protein: 

MTRRCSHCSHNGHNSRTCPNRGVKIFGVRLTDGSIRKSASMGNLSLLSSAAGSTSGGAS PADGPDAAPTAADGYASD 
DFVQGFSSATRDRKKGVPWTEEEHRRFLLGLQKLGKGDWRGISRNFVVSRTPTQVASHAQKYFIRQSNMTRRKRRSS 
LFDMVPDESMDLPPLPGGQEPETQVLNQPALPPPKEEEEVDSMESDTSAVAESSSASAIMPDNLQSTYPVIVPAYFS 
PFLQFSVPFWQNQKDEDGPVQETHEIVKPVPVHSKSPINVDELVGMSKLSIGESNQETESTSLSLNLVGGQNRQSAF 

HANPPTRAQA (SEQ ID NO: 9) 

Accordingly, the invention features a pure polypeptide including an amino acid sequence 
at least 70% (e.g., at least 75, 80, 85, 90, or 95; or 100%) identical to SEQ ID NO:7, 8, or 9. 
When expressed in a cell, the polypeptide binds to a promoter and regulates transcription from 
the promoter. The cell can be a plant cell, in particular, a monocot plant cell (e.g., a cereal plant 
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cell such as a rice cell or a barley cell). The promoter can contain one or more copies of a 
TATCCA sequence or its variant. Moreover, an HvMYBGa protein can be co-expressed in the 
cell. HvMYBGa interacts with the OsMYBS proteins and they together cooperatively regulate 
the expression of a gene. The polypeptides of the invention can be used for producing OsMYBS 
5 antibodies (either monoclonal or polyclonal). These antibodies in turn are useful for detecting 
the presence and distribution of OsMYBS proteins in tissues and in cellular compartments. For 
example, such antibodies can be used to verify the expression of OsMYBS proteins in a 
transgenic plant. 

A "pure polypeptide" refers to a polypeptide substantially free from naturally associated 

10 molecules, i.e., it is at least 75% (e.g., at least 80, 85, 90, or 95; or 100%) pure by dry weight. 
Purity can be measured by any appropriate standard method, for example, by column 
chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 

The "percent identity" of two amino acid sequences is determined using the algorithm of 
Karlin and Altschul ((1990) Proc. Natl. Acad. Sci. USA 87, 2264-2268), modified as in Karlin 

15 and Altschul ((1993) Proc. Natl. Acad. Sci. USA 90, 5873-5877). Such an algorithm is 

incorporated into the XBLAST programs of Altschul, et al. ((1990) J. Mol. Biol. 215, 403-410). 
BLAST protein searches are performed with the XBLAST program, score = 50, wordlength = 3. 
Where gaps exist between two sequences, Gapped BLAST is utilized as described in Altschul, et 
al. ((1997) Nucleic Acids Res. 25, 3389-3402). When utilizing BLAST and Gapped BLAST 

20 programs, the default parameters of the respective programs (e.g., XBLAST) are used. See 
www.ncbi.nlm.nih.gov. 

The invention further features an isolated nucleic acid (e.g., a vector) encoding a 
polypeptide of the invention, and a cell (in a culture or in a transgenic plant) containing a nucleic 
acid of the invention. The cell can be a plant cell, in particular, a monocot plant cell (e.g., a 

25 cereal plant cell such as a rice cell or a barley cell). An example of a nucleic acid within the 
invention includes an isolated nucleic acid characterized in that it hybridizes under stringent 
conditions to SEQ ID NO:l, 2, or 3, or a complementary sequence thereof. Such a nucleic acid 
can be at least 15 (e.g., at least 30, 50, 100, 200, 500, or 1000) nucleotides in length. These 
nucleic acids and cells can be used for generating a transgenic plant or producing the 

30 polypeptides of the invention. For instance, the nucleic acids of the invention can be used to 

determine whether an OsMYBS mRNA is expressed in a tissue or cell. The nucleic acids can be 
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used as primers in PCR-based detection methods, or as labeled probes in nucleic acid blots (e.g., 
Northern blots). 

An "isolated nucleic acid" is a nucleic acid the structure of which is not identical to that 
of any naturally occurring nucleic acid or to that of any fragment of a naturally occurring 
5 genomic nucleic acid. The term therefore covers, for example, (a) a DNA which has the 

sequence of part of a naturally occurring genomic DNA molecule but is not flanked by both of 
the coding sequences that flank that part of the molecule in the genome of the organism in which 
it naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA of a 
prokaryote or eukaryote in a manner such that the resulting molecule is not identical to any 
10 naturally occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic 
fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction fragment; 
and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a 
fusion protein. 

By hybridization under "stringent conditions" is meant hybridization at 65°C, 0.5 X SSC, 

15 followed by washing at 45°C, 0. 1 X SSC. 

A plant cell of the invention can be cultivated to generate a transgenic plant. The 
transgenic plant can be a monocot plant, e.g., a cereal plant such as rice or barley. The genome 
of such a transgenic plant contains a transgene that includes a nucleic acid characterized in that it 
hybridizes under stringent conditions to SEQ ID NO:l, 2, or 3, or a complementary sequence 

20 thereof. The transgene, when expressed in a cell, regulates the activity of promoters, e.g., those 
containing one or more copies of a TATCCA sequence or its variant. 

In addition, the invention features a method of expressing a transcript in a cell (e.g., a 
plant cell). The method involves introducing into a cell a vector that contains a nucleic acid 
encoding a transcript and expressing the transcript in the cell. The transcript is characterized in 

25 that it hybridizes under stringent conditions to SEQ ID NO: 1, 2, or 3, or a complementary 

sequence thereof. The transcript can encode a polypeptide of the invention, which regulates the 
activity of promoters, e.g., those containing one or more copies of a TATCCA sequence or its 
variant. On the other hand, the transcript can be an anti-sense RNA, which inhibits the 
expression of endogenous OsMYBS genes, thereby deregulates the activity of promoters, e.g., 

30 those containing one or more copies of a TATCCA sequence or its variant. 
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The invention also features a method of identifying a compound that modulates the 
activity of the polypeptide of this invention, i.e., the ability of the polypeptide to regulate 
expression of a gene. The method includes (1) introducing a test compound into a system 
containing a nucleic acid having a promoter containing one or more copies of an element that 
5 binds to the polypeptide, e.g., a TATCCA sequence, the promoter being operably linked to a 
reporter gene; and (2) determining an expression level of the reporter gene in the system. The 
expression level of the reporter gene in the presence of the test compound, if different from that 
in the absence of the test compound, indicates that the test compound is a candidate for 
modulating the activity of the polypeptide. The system, which can be a cell or a cell-free system 

10 (e.g., a cell lysate) may or may not contain the polypeptide of this invention. In the latter case, to 
practice the method, one can introduce into the system the polypeptide or a nucleic acid encoding 
the polypeptide. The test compound can be a small molecule, a polypeptide, or a nucleic acid 
that encodes a test polypeptide or binds to the polypeptide of this invention. The reporter gene 
can be a LacZ gene, a green fluorescent protein gene, a GUS gene, or a luciferase gene. 

15 This invention provides a system for highly regulated expression of endogenous or 

exogenous genes in a plant cell or a transgenic plant. The details of some embodiments of the 
invention are set forth in the accompanying description below. Other features, objects, and 
advantages of the invention will be apparent from the detailed description, and from the claims. 

DETAILED DESCRIPTION 

20 Expression of a-amylase genes in cereals is induced by both gibberellin (GA) and sugar 

starvation. All a-amylase genes isolated from cereals contain a TATCCA element or its variant 
at positions approximately 90 to 150 bp upstream of the transcription start site. The TATCCA 
element is an important component of the GA response complex (GARC) and the sugar response 
complex (SRC) of a-amylase gene promoters. 

25 Three cDNA clones encoding novel MYB proteins with single DNA binding domain 

have been isolated from a rice suspension cell cDNA library and designated OsMYBSl, 
OsMYBS2 and OsMYBS3. Unexpectedly, OsMYBS proteins bind specifically to the TATCCA 
element in vitro. In vivo, OsMYBSl and OsMYBS2 bind to the TATCCA element and 
transactivate a promoter containing the TATCCA element when sugar is provided, whereas 

30 OsMYBS3 represses transcription of the same promoter under sugar starvation. Furthermore, 
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these three OsMYBS proteins cooperate with a GA-regulated transcription factor, GAMyb 
(HvMYBGa), in transactivation of a low pi barley oc-amylase gene promoter in the absence of 
GA. 

In one aspect, the present invention features pure OsMYBS polypeptides (e.g., SEQ ID 

5 NOs:7-9), including functional OsMYBS polypeptides. A "functional polypeptide" refers to a 
polypeptide which possesses biological activity equivalent to that of a wild-type OsMYB 
protein, e.g., a fragment of a wild-type OsMYBS protein. 

In another aspect, the invention features isolated OsMYBS nucleic acids (i.e., DNA, 
cDNA, and RNA) characterized in that they hybridizes under stringent conditions to SEQ ID 

10 NO:l, 2, or 3, or a complementary sequence thereof. The nucleic acids of the invention include 
sequences that are degenerate as a result of the genetic code. 

A nucleic acid of the invention can be expressed in vitro by DNA transfer into a suitable 
host cell by methods known in the art. For example, the nucleic acid can be inserted into a 
recombinant expression vector. A variety of host-expression vector systems can be utilized to 

1 5 express a nucleic acid of the invention. These include, but are not limited to, microorganisms 
such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA, or cosmid 
DNA expression vectors; yeast transformed with recombinant yeast expression vectors; plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV), or transformed with recombinant plasmid expression 

20 vectors (e.g., Ti plasmid). Isolation and purification of recombinant polypeptide, or fragments 
thereof, provided by the invention, can be carried out by conventional means including 
preparative chromatography and immunological separations involving monoclonal or polyclonal 
antibodies. 

The invention also features antibodies against an OsMYBS polypeptide, including 
25 monoclonal antibodies and polyclonal antibodies. The term "antibody" includes intact molecules 
as well as fragments thereof, such as Fab, F(ab') 2 , and Fv which are capable of binding to an 
epitopic determinant present in an OsMYBS polypeptide. Methods of making monoclonal and 
polyclonal antibodies and fragments thereof are known in the art. See, for example, Harlow and 
Lane (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, New York. 
30 Also within the scope of the invention is a transgenic plant, the genome of which contains 

a transgene that includes a nucleic acid characterized in that it hybridizes under stringent 

7 
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conditions to SEQ ID NO:l, 2, or 3, or a complementary sequence thereof. The transgene, when 
expressed in a cell, regulates the activity of promoters, e.g., those containing one or more copies 
of a TATCCA sequence or its variant. 

As used herein, the term "plant" refers to either a whole plant, a plant part, a plant cell, or 
a group of plant cells, such as plant tissue. Plantlets are also included within the meaning of 
"plant". Plants included in the invention are any plants amenable to transformation techniques, 
including angiosperms, gymnosperms, monocotyledons, and dicotyledons. Examples of 
monocotyledonous plants include, but are not limited to, asparagus, field and sweet corn, barley, 
wheat, rice, sorghum, onion, pearl millet, rye, and oats. Examples of dicotyledonous plants 
include, but are not limited to, tomato, tobacco, cotton, rapeseed, field beans, soybeans, peppers, 
lettuce, peas, alfalfa, clover, cole crops or Brassica oleracea (e.g., cabbage, broccoli, cauliflower, 
brussel sprouts), radish, carrot, beets, eggplant, spinach, cucumber, squash, melons, cantaloupe, 
sunflowers, and various ornamentals. Woody species include poplar, pine, sequoia, cedar, oak, 
etc. 

Transgenic plants can be produced using methods well known in the art. See, for 
example, Weissbach and Weissbach (1988) Methods for Plant Molecular Biology, Academic 
Press, NY, Section VIII, pp. 421-463; Grierson and Corey (1988) Plant Molecular Biology, 2d 
Ed., Blackie, London, Ch. 7-9; and Horsch, et al. (1985) Science 227, 1229. 

In addition, the invention features a method of expressing a transcript in a cell (e.g., a 
plant cell). The method involves introducing into a cell a vector that contains a nucleic acid 
encoding a transcript and expressing the transcript in the cell. The transcript is characterized in 
that it hybridizes under stringent conditions to SEQ ED NO:l, 2, or 3, or a complementary 
sequence thereof. The transcript can be an RNA that interferes with the function of another 
RNA in a plant cell, for example, preventing an mRNA from being translated into a polypeptide, 
or triggering specific RNA degradation to facilitate targeted post-transcriptional gene silencing 
(Mol et al. (1990) FEBS Lett 268(2), 427-30; and Fire et al. (1998) Nature 391, 806-811). 
Alternatively, the transcript can encode a polypeptide for over-production of the polypeptide in 
transformed plant cells or a transgenic plant. Alterations in OsMYBS expression result in 
changes in expression of genes regulated by OsMYBS proteins, e.g., those directed by promoters 
containing one or more copies of a TATCCA sequence or its variant. 
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Also within the scope of this invention is a method of identifying a compound that 
modulates the ability of the above-described polypeptide (e.g., OsMYBS) to regulate expression 
of a gene. The compound can be used to enhance or repress the expression of a gene regulated 
by the polypeptide, e.g., a-amylase gene. To identify the compound, one can introduce a test 

5 compound into a system containing a nucleic acid having a promoter containing one or more 

copies of an element that binds to the polypeptide, e.g., a TATCCA sequence. The promoter is 
operably linked to a reporter gene, such as a LacZ gene, a green fluorescent protein gene, a GUS 
gene, or a luciferase gene. One can then determine an expression level, e.g., the mRNA or 
protein level, of the reporter gene in the system by standard methods. The expression level of 

10 the reporter gene in the presence of the test compound, if statistically different from that in the 
absence of the test compound, indicates that the test compound is a candidate for modulating the 
activity of the polypeptide. The system can be a cell containing endogenous OsMYB, e.g., a rice 
or barley cell, or a cell lacking OsMYB, e.g., a yeast cell. In the latter case, to practice the 
method, one can express OsMYB in the cell by introducing into it an expression vector encoding 

15 the polypeptide. The system can also be a cell-free system well known in the art, such as a cell 
lysate system,. 

The test compound can be a small molecule compound, a polypeptide, or a nucleic acid 
that encodes a test polypeptide or binds to the polypeptide of this invention. The test compound 
can be obtained from compound libraries, such as peptide or peptoid libraries. The libraries can 

20 be spatially addressable parallel solid phase or solution phase libraries. See, e.g., Zuckermann et 
al. J Med Chem 37, 2678-2685, 1994; and Lam Anticancer Drug Des 12:145, 1997. Methods for 
the synthesis of compound libraries are well known in the art, e.g., DeWitt et al. PNAS USA 
90:6909, 1993; Erb et al. PNAS USA91:11422, 1994; Zuckermann et al. J Med Chem 37:2678, 
1994; Cho et al. Science 261:1303, 1993; Carrell et al. Angew Chem Int Ed Engl 33:2059, 1994; 

25 Carell et al. Angew Chem Int Ed Engl 33:2061, 1994; and Gallop et al. J Med Chem 37:1233, 
1994. 

The specific examples below are to be construed as merely illustrative, and not limitative 
of the remainder of the disclosure in any way whatsoever. Without further elaboration, it is 
believed that one skilled in the art can, based on the description herein, utilize the present 
30 invention to its fullest extent. All publications recited herein are hereby incorporated by 
reference in their entirety. 
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Materials and Methods 

1. Rice Cell Culture 

A suspension cell culture of rice (Oryza sativa cv. Tainan 5) was established as 
previously described (Yu, et al. (1991) J. Biol. Chem. 266, 21131-21 137). Cells were 
5 subcultured every 7 days by transferring approximately 0.5 ml of cells into 25 ml of fresh liquid 
Murashige and Skoog medium (Murashige and Skoog (1962) Physiol Plant 15, 473-497) 
containing 3% sucrose in a 125-ml flask. Cells were cultured on a reciprocal shaker at 120 rpm 
and incubated at 26EC in the dark. 

2. Plasmids 

10 Plasmid aAmy8-C carries a 1 .4-kb rice a-amylase cDNA insert in pBluescript KS+ 

(Stratagene) (Yu, et al. (1992) Gene 122, 247-253). Plasmid pcRAcl.3 contains a 1.4-kb rice 
actin gene (Actl) cDNA insert in pBluescript II-KS (McElroy, et al. (1990) Plant Mol. Biol. 15, 
257-268). Plasmid pRY18 carries a 3.8 kb DNA fragment that contains a rice genomic rDNA 
cluster, including the 3 f half portion of 18S rRNA gene, the complete 5.8S rRNA gene, and the 5' 

15 half portion of the 25S rRNA gene in pUC13 (Sano and Sano (1990) Genome 33, 209-218). 
Plasmid pGAMyb contains the HvMYBGa cDNA fused between the maize ubiquitin (Ubi) 
promoter and the nopaline synthase (Nos) terminator (Cercos, et al. (1999) Plant J. 19, 107-1 18). 
Plasmid pAHC18 contains the luciferase (Luc) cDNA fused between the Ubi promoter and the 
Nos terminator (Bruce, et al. (1989) Proc. Natl. Acad. Sci. USA 86, 9692-9696). Plasmid 

20 pAmy32b-GUS contains the 331 -bp promoter region, the entire 5' untranslated sequence, and the 
first intron of Amy32b fused to the GUS coding sequence and the 3 'untranslated region of 
Amy32b (Gomez-Cadenas, et al. (1999) Proc. Natl. Acad. Sci. USA 96, 1767-1772). A reporter 
construct, p3Luc.l8, contains a sugar response sequence (SRS, -186 to - 82 nucleotide) of rice a 
amylase, a Amy 3. The SRS contains three motifs: a GC box, a G box, and the TATCCA 

25 element, essential for high level sugar starvation-induced promoter activity in rice protoplasts 
(Lu et al. (1998) J. Biol. Chem. 273, 10120-10131). The SRS is fused upstream to the 
CaMV35S minimal promoter-Luc chimeric gene. Another reporter construct, T7, contains a 
modified aAmy8SRS/GARS promoter. This promoter has a GC box, a G box, a gibberellin 
response element (GARE), two copies of the TATCCA element, all of which are linked upstream 

30 to the CaMV35S minimal promoter-Luc chimeric gene. 
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3. South-western Screening of cDNA Library 

Rice suspension cells were cultured in sucrose-containing medium for 5 days and 
transferred to sucrose-containing (+S) or sucrose-free (-S) medium for 4 h. Cells were collected 
and total RNA was purified. Poly(A) + RNA was further purified using an oligo(dT) cellulose 
5 spin column (5 Prime — ► 3 Prime). The poly(A) + RNA isolated from -S cells was used to 
construct a cDNA library in Lambda GEM-2 vector (Promega). E. coli strain Y1090 
(Stratagene) served as the bacterial host. For screening and plaque purification, the 416-bp DNA 
fragment containing eight tandem repeats of the -133 to -82 region of the aAmy3 promoter was 
excised with PstI and Xhol from p3Luc.44 (Lu, et al. (1998) J. Biol. Chem. 273, 10120-10131) 
10 and used as a probe for the cDNA library screening. The library was screened according to the 
method of Singh, et al. (1988) Cell 52, 415-423. Positive phage plaques were isolated and phage 
DNA was purified for further characterization. 

4. Plasmid Construction 

OsMYBSl and OsMYBS2 cDNAs were excised from Lambda GEM-2 with Sail and 
15 NotI, and inserted into the same sites in pBluescript (Stratagene) to generate pBS-Sl and pBS- 
S2, respectively. There is one Sail site within the OsMYBS3 cDNA, therefore, OsMYBS3 
cDNA was excised from Lambda GEM-2 with EcoRI and NotI and inserted into the same sites 
in pBluescript to generate pBS-S3. pAHC18 was digested with BamHI to remove the Luc 
cDNA insert and was then end-blunted. OsMYBSl and OsMYBS2 were excised from pBS-Sl 
20 and pBS-S2, respectively, with Sail and NotI and end-blunted. OsMYBS3 was excised from 
pBS-S3 with EcoRI and NotI and end-blunted. The OsMYBS cDNAs were then individually 
ligated with the truncated pAHC 1 8 to generate an Ubi-OsMYBS-Nos fusion gene. An 
expression vector encoding Ubi-OsBZ8, pUBZ8, was also constructed. OsBZ8 (GenBank 
Access No.: U42208) is a basic leucine zipper (bZEP) DNA binding protein. To make this 
25 vector, pAHC18 was digested with BamHI to remove the Luc cDNA insert and blunt-ended. 

The OsBZ8 cDNA was excised from a pBluescript plasmid with PstI and Spel, blunt-ended, and 
ligated to the pAHC18. 

5. Genomic DNA Gel Blot Analysis 

Genomic DNA was isolated from rice calli according to the method of Sheu, et al. (1996) 
30 J. Biol. Chem. 271 , 26998-27004. Ten mg of genomic DNA was digested with restriction 
enzymes, fractionated in a 0.8% agarose gel, and transferred to a nylon membrane (MSI). 
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Hybridization was performed at 42°C using P random primer-labeled OsMYBS gene-specific 
DNA as a probe. pBS-Sl was digested with SacII, and the 613-bp DNA fragment containing the 
220-bp coding region and the 393-bp 3 'untranslated region of OsMYBS 1 cDNA served as 
OsMYBS 1 -specific DNA. pBS-S2 was digested with Sad, and the 334-bp DNA fragment 
containing the 241 -bp coding region and the 93-bp 3 'untranslated region of OsMYBS2 cDNA 
served as OsMYBS2-specific DNA. pBS-S3 was digested with Sad, and the 599-bp DNA 
fragment containing the 332-bp coding region and the 267-bp 3 'untranslated region of 
OsMYBS3 served as OsMYBS3-specific DNA. 

6. RNA Gel Blot Analysis 

Total RNA was purified from rice suspension cells using TRIZOL reagent (GIBCO- 
BRL). ot- 32 P-labeled DNA probes were prepared, and RNA gel blot analysis was performed as 
described (Sheu, et al. (1996) J. Biol Chem. 271, 26998-27004). In cases when the membrane 
blot was sequentially hybridized with various probes, the membrane was stripped and 
rehybridized as described (Sheu, et al, (1994) Plant J. 5, 655-664). Plasmid DNAs of ctAmy8-C 
and pcRAcl.3 were digested with EcoRI. <xAmy3 and aAmy8 gene-specific DNAs were 
prepared as described (Sheu, et al., (1996) J. Biol. Chem. 271, 26998-27004). These insert 
DNAs were individually isolated, labeled with a- 32 P, and used as probes. A DNA fragment 
containing 25S, 18S and 5.8S rDNAs was excised from pRY18 using BamHI, labeled with a- 
P, and used as a probe to equalize RNA loading. 

7. Expression and Purification of Recombinant Proteins 

OsMYBS was digested with Sail and NotI and ligated into the same sites in pET28b(+) 
(Novagen) to generate pET-Sl, pET-S2, and pET-S3. The three plasmids were each transferred 
into E. coli strain BL21(DE3), and OsMYBS 1, OsMYBS2, and OsMYBS3 were expressed. 
Purification of the OsMYBS proteins was performed according to the instructions provided by 
Novagen. The protein concentration was determined with Bradford reagent (Bio-Rad). 

8. Gel Mobility Shift Assay 

The gel mobility shift assay was performed essentially as previously described (Lu, et al. 
(1998) J. Biol. Chem. 273, 10120-10131), except that the DNA-protein binding reaction was 
carried out by incubation of 0.02 ng of 32 P-labeled oligonucleotide with 0.2 |ug recombinant 
OsMYBS purified from E. coli and 1 jig poly(dl-dC) in a total volume of 20 |il solution. A 
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DNA fragment containing the TATA box was prepared as previously described (Lu, et al. (1998) 
J. Biol. Chem. 273, 10120-10131). 

9. Construction of the Yeast Reporter Strains 

The DNA fragment containing eight tandem repeats of -133 to -82 of the ccAmy3 
5 promoter was excised from p3Luc.44 (Lu, et al. (1998) J. Biol. Chem. 273, 10120-10131) with 
EcoRI and Xhol, end-blunted and inserted into the Smal site in pLacZi (Clontech) to generate a 
8x(TATCCA+F)-Cycl minimal promoter-LacZ fusion gene. The two complementary 
oligonucleotides 5 '- AATTCTATCCATATCCATATCCATATCCATATCCATATCCAC-3 ' 
(SEQ ID NO: 10) and 5 '-GTGGATATGGATATGGATATGGATATGGATATGGATAG-3 ' 
10 (SEQ ID NO: 1 1) were annealed and inserted into the EcoRI and Smal sites of pLacZi to generate 
a 6xTATCCA-Cycl minimal promoter-LacZ fusion gene. These plasmids were linearized with 
Ncol and introduced into the genome of yeast strain YM4271, according to the 
MATCHMAKER One-Hybrid System protocol (Clontech), to generate the yeast reporter strains. 

10. Yeast One-Hybrid System for OsMYBS Binding Activity Assay 

1 5 For construction of the GAD-OsMYBS fusion gene, the full length OsMYBS cDNAs 

were PCR-amplified using pBS-Sl, pBS-S2 and pBS-S3 as the DNA templates and 
oligonucleotides OsMYBS 1-5' (5 '-AAACTCGAGAATGACCTCCCAGGCGGCGA-3 ' (SEQ 
ID NO: 12), Xhol site underlined) and OsMYBS 1-3' (5'- 

ATCGAATTCTC ATTGGTGCATCTTGGCCGGA-3 ' (SEQ ID NO: 13), EcoRI site underlined) 
20 as primers for OsMYBS 1 , OsMYBS2-5 ' (5 '-AAACTCGAGAATGCCCAACCTCACCTCCA- 
3' (SEQ ID NO:14)), Xhol site underlined) and OsMYBS2-3' (5 ' - AGC G AATTC TT AT A 
TAAAACTGGTGAA-3' (SEQ ID NO:15)), EcoRI site underlined) as primers for OsMYBS2, 
and OsMYBS3-5' (5 '-AAACTCGAGTATGACGAGGCGGTGCTCGCA-3 ' (SEQ ID NO: 16)), 
Xhol site underlined) and OsMYBS3-3' (5 '-ATCGAATTCTCATGCCTGTGCCCTTGT-3 ' 
25 (SEQ ID NO: 1 7)), EcoRI site underlined) as primers for OsMYBS3. The GAD sequence was 

PCR-amplified using pGAD-424 (Clontech) as the DNA templates and oligonucleotides GAD-5' 
(5'-CCAGAATTCTGCAAAGATGGATAAA-3' (SEQ ID NO:18)), EcoRI site underlined) and 
GAD-3' (5 ' -CC A CTCGAG CTCTCTTTTTTTGGGT-3 ' (SEQ ID NO: 19)), Xhol site 
underlined) as the primers. All the PCR products were digested with Xhol and EcoRI. The 
30 OsMYBS cDNAs were individually ligated with the GAD sequence in the Xhol site and inserted 
into the EcoRI site of pBluescript to generate pGAD-OsM YBS . pGAD-424 was digested with 
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Mlul and EcoRI to remove the 3' portion of the GAD domain. The GAD 3' portion-OsMYBS 
fusion genes were excised from pGAD-OsMYBS with Mull and EcoRI and inserted into the 
Mlul and EcoRI sites of pGAD-424 to generate pGAD-OsMYBS-424 that contains the ADH1 
promoter-GAD-OsMYBS fusion gene. These plasmids were individually introduced into the 
5 yeast reporter strain, which hosts the 8x(TATCCA+F)-Cycl minimal promoter-LacZ fusion gene 
or the 6xTATCCA-Cycl minimal promoter-LacZ fusion gene, according to the 
MATCHMAKER One-Hybrid System protocol (Clontech). 6-Galactosidase activity in yeast 
was assayed. 

11. Yeast One-hybrid System for OsMYBS Transactivation Assay 

10 For construction of the GBD-OsMYBS fusion gene, the GBD sequence was PCR- 

amplified using pGBT9 (Clontech) as the DNA template and oligonucleotides GBD-5' (5'- 
CC AGAATTC AGATGAAGCTACTGTCT-3 ' (SEQ ID NO:20)), EcoRI site underlined) and 
GBD-3' (5 '-CCACTCGAGTTCGATACAGTCAACTGT-3 ' (SEQ ID NO:21)), Xhol site 
underlined) as primers. Because there is one Xhol cutting site within the GBD sequence, the 

15 PCR product was digested with EcoRI first and then partially digested with Xhol. The resulting 
DNA fragment was ligated with OsMYBS cDNA and inserted into the EcoRI site of pBluescript 
to generate pGBD-OsMYBS. pGAD-424 was digested with Hindlll to remove the GAD 
sequence and end-blunted. The GBD-OsMYBS fusion genes were excised from pGBD- 
OsMYBS with EcoRI and Hindlll, end-blunted, and ligated with the truncated pGAD-424 to 

20 generate pGBD-OsMYBS-424 that contains the ADH1 promoter-GBD-OsMYBS fusion gene. 
These plasmids were introduced into the yeast strain CG-1945 that hosts the 3xUAS-Cycl 
minimal promoter-LacZ and GAL1 promoter-HIS3 reporter constructs (Clontech). 3xUAS is 
three tandem repeats of a synthetic UASol7-mer consensus sequence that can be recognized by 
GBD. 6-Galactosidase activity in yeast or growth of yeast cells on selective medium was 

25 assayed. 

12. 6-Galactosidase Activity and Yeast Cell Growth Assays 

p-galactosidase activity in transformed yeast cells was quantified by using ONPG (o- 
nitrophenyl P-galactopyranoside) as the substrate. Transformed cells were selected in a medium 
containing 5 mM 3-aminotriazole but no histidine according to the Yeast Protocols Handbook 
30 (Clontech). 
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13. Barley Aleurone Tissue Transient Expression Assay 

Particle bombardment of the barley (Hordeum vulgare) aleurone tissue and transient 
assay of GUS were performed essentially as described by Lanahan, et al. (1992) Plant Cell 4, 
203-21 1 and Cercos, et al. (1999) Plant J. 19, 107-1 18. Bombarded barley embryoless half-seeds 
were incubated in a buffer (20 mM each of CaCl 2 and sodium succinate, pH 5.0) containing or 
lacking glucose or GA 3 for 20 h, and luciferase or GUS activity determined. All the 
bombardments were repeated at least four times. 

14. Rice embryo transient expression assay 

Rice seeds (Oryza sativa cv. Tainung 67) were placed in Petri dishes containing two 
pieces of sterile Whatman No.l filter paper and a liquid MS salt mixture containing 2mg/L 2,4-D 
(Toyofuku et al. (1998) FEBS Lett 428: 275-280). After incubation for 8 days, embryos with 
calli growth on their scutella were dissected from the endosperm. These embryos were then 
placed, with the scutellar side up, in plates containing solid MS medium. Each plate contained 
24 embryos. Particle bombardment of the callus-containing embryos was performed as 
described in Umemura et al. (1998) Planta 204: 420-428. The embryos were co-bombarded with 
reporter, effector and internal control plasmids at a ratio of 1 : 1 : 0.25. pGL AC was used as the 
internal control. It contains the rice Actl promoter fused upstream of the GUS cDNA (McElroy 
et al. (1990), Plant Mol Biol 15: 257-268). An unrelated yeast plasmid pRS426 was used in the 
co-bombardment as a negative control effector. 

After bombardment, half of the embryos were incubated in MS medium containing 100 
mM glucose or 100 mM mannitol for 18 hours. Total proteins were extracted from the embryos 
with a CCLR buffer [100 mM KH2 (P04), pH7.8, 1 mM EDTA, 10% glycerol, 1% Triton X- 
100, 7 mM p-mercaptoethanol]. The GUS and luciferase activity assays were performed using 
standard methods. Each experiment was repeated three or four times. 

Results 

1. Three OsMYBS Genes Encode MYB Proteins with One DNA-binding Repeat 

To obtain cDNAs encoding proteins that bind specifically to the TATCCA element, the 
South-western method was used to screen a cDNA library constructed from poly(A) + mRNA 
prepared from 4-h sucrose-starved rice suspension cells. A 416-bp probe consisting of 8 tandem 
repeats of the -133 to -82 DNA fragment of ccAmy3 promoter (Lu, et al. (1998) J. Biol. Chem. 
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273, 10120-10131), which contains the TATCCA element, was used. cDNA clones that 
interacted specifically with the probe were isolated and their sequences were analyzed. Three 
different genes were identified and designated OsMYBSl, OsMYBS2, and OsMYBS3. The 
open reading frames of OsMYBSl, OsMYBS2, and OsMYBS3 encode polypeptides of 306, 
5 276, and 3 1 8 amino acid residues, respectively. The three OsMYBS proteins contain a highly 
conserved single DNA-binding domain that is also highly similar to the DNA-binding domains 
of other mammalian, Drosophila and plant MYB proteins. 

Compared with other MYB proteins that contain a 1R in their DNA-binding domains, the 
1R residues of the three OSMYBS proteins are most closely related to MybStl (StMYBl), and 

10 less related to AtMYBL2, PcMYBl, and CCA1 (AtMYBCCAl). StMYBl transactivates the 
cauliflower mosaic virus 35S RNA gene (CaMV35S) promoter and is expressed in various 
organs of potato (Baranowskij, et al. (1994) EMBO J 13, 5383-5392). AtMYBL2 is expressed in 
Arabidopsis leaf but its function has not been characterized (Kirik and Baumlein, (1996) Gene 
183, 109-1 13). PcMYBl interacts in vivo with a light-regulatory promoter unit and is expressed 

15 in Petroselinum crispum cell cultures and seedlings (Feldbriigge, et al. (1997) Plant J. 1 1, 1079- 
1093). AtMYBCCAl binds to the promoter of an Arabidopsis light-harvesting chlorophyll a/b 
binding protein gene, Lhcb, and mediates phytochrome regulation of Lhcb (Wang, et al. (1997) 
Plant Cell 9, 491-507). The 1R sequence of OsMybS3 is most homologous to StMYBl (92% 
identity) and somewhat homologous to OsMYBSl (87% identity) and OsMYBS2 (85% 

20 identity), and the 1R sequences of OsMYBSl and OsMYBS2 are least homologous (77% 
identity) to each other. There is very low homology among the N- and C-terminal regions 
outside the 1R regions of all the MYB proteins with 1R DNA-binding domains, except that 
OsMYBS3 and StMYBl have 71% identity at the N-terminal 90-amino-acid region and 62% 
identity at the C-terminal 70-amino-acid region. 

25 Comparison of amino acid residues between the 1R regions of the three OsMYBS 

proteins and the R2 and R3 repeats of the animal and plant MYB proteins shows that the 1R 
regions contain conserved tryptophan (W), glutamic acid (E), glycine (G), and arginine (R) at the 
same positions. The tryptophan residue, which plays a critical role in stabilizing the DNA 
binding domain of animal MYB proteins (Ogata, et al. (1992) Proc. Natl. Acad. Sci. USA 89, 

30 6428-6432), is conserved at the first and second positions, but not at the third position, among 
most of the 1R regions. The putative base-contacting residues in the third helix of the animal 
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MYB proteins are conserved in the plant R2R3 MYB proteins but not in the plant 1R MYB 
proteins. 

The 3 'untranslated regions of the three OsMYBS genes share very low homology (33- 
35% identity) and were used as gene-specific DNA probes for rice genomic DNA gel blot 
5 analysis. Only one band hybridized with each of the three gene-specific DNA probes. This 

result indicates that there is only one copy of each of the three OsMYBS genes in the rice MYB 
gene family. 

2. OsMYBS Genes Are Expressed in Various Rice Tissues and Barley Aleurones 

The expression patterns of OsMYBS in rice were analyzed. Total RNA was purified 

10 from various rice tissues and subjected to gel blot analysis using OsMYBS-specific DNA as 
probes. OsMYBS 1 is expressed in tissues above ground with the highest level in leaves. 
OsMYBS2 is expressed mainly in roots, leaves and senescence leaves at similar levels, and 
OsMYBS3 is expressed in all tissues with the highest level in senescence leaves. 

To determine whether OsMYBS gene expression is regulated by sugars, total RNA was 

15 isolated from rice suspension cells cultured in the presence or absence of sucrose for 48 h and 
subjected to gel blot analysis using OsMYBS-specific DNAs as probes. The levels of 
OsMYBS 1 and OsMYBS3 mRNAs are low in the presence of sucrose but increased in the 
absence of sucrose, which are parallel to the mRNA levels of otAmy3 and another rice oc- 
amylase gene, ocAmy8. By contrast, the levels of OsMYBS2 are higher in the presence than in 

20 the absence of sucrose. 

Transient expression assays with barley half seeds were used as a system for studying the 
function of OsMYBS prteins in sugar and hormone regulation in later experiments. To 
determine whether OsMYBS genes are expressed in barley aleurone cells, total RNA was 
isolated from barley half seeds and subjected to gel blot analysis using OsMYBS-specific DNAs 

25 as probes. All of these OsMYBS genes are expressed in barley aleurone cells in the absence of 
glucose. The expression levels of OsMYBS 1 and OsMYBS3 are suppressed in the presence of 
glucose or GA, but remain unchanged in the presence of ABA. The expression of OsMYBS2 is 
enhanced in the presence of glucose, and slightly suppressed by ABA treatment. 

3. OsMYBS Proteins Interact Specifically with the TATCCA Element in vitro 

30 To determine whether the three OsMYBS proteins bind specifically to the TATCCA 

element, they were expressed in E. coli. Interaction between affinity-purified recombinant 
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OsMYBS proteins and five DNA fragments encompassing -171 to -82 region of the aAmy3 SRS 
was analyzed by gel mobility shift assay. The five DNA fragments, designated Fl through F5, 
were previously used in gel mobility shift assays for their interaction with nuclear protein extract 
from rice suspension cells (Lu, et al. (1998) J. Biol Chem. 273, 10120-10131). F2, which 

5 contains the duplicated TATCCA element with some flanking sequence in SRS, was used as the 
probe for gel mobility shift assays. The three recombinant OsMYBS proteins bound to probe F2, 
resulting in the formation of low mobility complexes. The interaction between OsMYBS 1 or 
OsMYBS2 and F2 was competed out only by F2 and the DNA fragment containing 6 tandem 
repeats of the 6-bp TATCCA element but not by 500-fold excess of other DNA fragments. The 

10 interaction between OsMYBS3 and F2 was also competed out only by F2; F5, which contains 

the TATCCA element; and the DNA fragment containing 6 tandem repeats of the 6-bp TATCCA 
element. A 17-bp DNA fragment containing the TATA box sequence (TAT AT A), which differs 
from the TATCCA element by only two nucleotides, did not compete for binding. 

To further verify the specificity of binding between OsMYBS proteins and the TATCCA 

15 element, a series of site-directed mutations spanning region -1 19 to -102 that containing the 

TATCCA tandem repeat were constructed. The wild type (Wt) sequence was used as the probe 
for gel mobility shift assays. The three recombinant OsMYBS proteins bound to the Wt probe, 
resulting in the formation of low mobility complexes. Also, the bindings of each of these 
OsMYBS proteins were completely competed out by unlabeled Wt sequence. The DNA 

20 sequence with mutations throughout both repeats (Ml) failed to compete, and sequences with 
either of the repeats replaced (M2 and M3) competed less effectively. Mutating three of the six 
bases in both of these repeats (M4, M5, and M6) could only slightly compete. These results 
suggest that all of these OsMYBS proteins bind specifically to two copies of the TATCCA 
element. However, the binding affinity decreases if only one copy of TATCCA exists. 

25 4. OsMYBS proteins Interact with the TATCCA Element in vivo 

The in vivo binding activity of OsMYBS proteins to the TATCCA element was analyzed 
using a yeast one-hybrid system. The full-length OsMYBS sequences were individually fused at 
the C-terminus with the activation domain of the yeast GAL4 transcriptional activator (GAD). 
The GAD-OsMYBS chimeric gene was then fused downstream of the yeast ADH1 promoter and 

30 used as an effector construct. A yeast strain containing a Cycl minimal promoter-LacZ fusion 
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reporter construct was then transformed with the effector construct. The presence of GAD alone 
or together with the GAD-OsMYBS fusion protein did not lead to expression of LacZ. 

Eight tandem repeats of the -133 to -82 DNA sequence of otAmy3, designated 
8x(TATCCA+F), was fused upstream of the Cycl minimal promoter-LacZ coding region and 

5 served as a reporter construct. The reporter construct was delivered into yeast first, and the yeast 
was then transformed with the effector constructs containing the ADHl-GAD-OsMybS fusion 
gene. The presence of GAD alone increased LacZ expression by 5-fold, indicating that the 
TATCCA element with flanking sequences slightly enhanced the Cycl minimal promoter 
activity even in the absence of OsMYBS. The presence of GAD-OsMYBSl and GAD- 

10 OsMYBS2 increased LacZ expression by 3- and 5-fold, respectively, compared with LacZ 

expression in the absence of OsMYBS. In contrast, the presence of OsMYBS3 reduced LacZ 
expression by 2-fold. 

To test whether the flanking sequence of the TATCCA element affects the binding 
affinity of OsMYBS 1 and OsMYBS2, six tandem repeats of the 6-bp TATCCA element, 

15 designated 6xTATCCA, were fused upstream of the Cycl minimal promoter-LacZ coding region 
and served as a reporter construct. The reporter construct was delivered into yeast and the yeast 
was then transformed with the effector construct that contains the ADHl-GAD-OsMYBS fusion 
gene. Only the presence of GAD-OsMYBS2 increased LacZ expression. 
5. OsMYBS 1 and OsMYBS2 Are Transcriptional Activators 

20 To determine whether OsMYBS is a transcriptional activator, the full-length OsMYBS 

sequence was fused at the C-terminus of the DNA-binding domain of GAL4 (GBD). The 
chimeric gene was then fused downstream of the yeast ADH1 promoter and used as an effector 
construct. A yeast strain hosting the 3xUAS-Cycl minimal promoter-lacZ fusion reporter 
construct was then transformed with the effector construct. The presence of the GBD-OsMybSl 

25 fusion protein increased LacZ expression 8-fold, GBD-OsMYBS2 only slightly increased LacZ 
expression, and GBD-OsMYBS3 did not increase LacZ expression. 

A yeast strain harboring the GAL1 promoter-HIS3 fusion reporter construct was also 
transformed with the effector construct that contains the ADHl-GBD-OsMYBS fusion gene. 
The presence of the GBD-OsMYBSl fusion protein allowed yeast cells to grow well on a 

30 medium lacking histidine in the presence of 3 -AT. The presence of the GBD-OsMYBS2 fusion 
protein allowed yeast cells to grow slowly, but the presence of the GBD-OsMYBS3 fusion 
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protein did not allow yeast cells to grow on the selective medium. The above results suggest that 
OsMYBSl is a strong transcriptional activator, OsMYBS2 is a weak transcriptional activator, 
and OsMYBS3 is not a transcriptional activator in yeast. 

6. OsMYBSl and OsMYBS2 Derepress Sugar-repressed Transcription of a Promoter Containing 
SRS 

To determine the function of OsMYBS in sugar regulation of a-amylase gene expression 
in plant cells, the transactivation ability of OsMYBS on SRS fused to the CaMV35S minimal 
promoter (-46 bp upstream of transcription start site), forming an SRS-CaMV35S chimeric 
promoter, was analyzed in barley aleurone cells that yield more consistent results than rice 
protoplasts. The three OsMYBS cDNAs were individually fused downstream of the maize 
ubiquitin gene (Ubi) promoter, forming Ubi-OsMYBS chimeric genes, which served as effectors. 
The coding region of the luciferase gene (Luc) was fused downstream of the SRS-CaMV35S 
chimeric promoter and served as a reporter. The barley half seeds were particle-bombarded 
simultaneously with the effector and reporter plasmids. The bombarded half seeds were divided 
into two halves and each half was incubated with 0.3 M glucose or 0.3 M mannitol for 24 h, and 
luciferase activity was determined. Luciferase activity was higher in the absence of glucose 
when the half seeds were co-bombarded with the reporter plasmid and plasmid pRS426, an 
unrelated yeast plasmid with a molecular weight similar to that of the effector plasmid, which 
was used as a negative control. This result is consistent with our previous finding that the SRS- 
CaMV35S chimeric promoter was activated in rice protoplasts in the absence of glucose and 
repressed in the presence of glucose (Lu, et al. (1998) J. Biol. Chem. 273, 10120-10131). When 
the half seeds were co-bombarded with the reporter and effector plasmids, OsMYBSl 
significantly enhanced luciferase activity in the presence of glucose to a level similar to the level 
in the absence of glucose. OsMYBSl also enhanced luciferase activity in the absence of 
glucose, but to a lesser extent. OsMYBS2 enhanced luciferase activity in the presence of 
glucose, but did not affect the activity in the absence of glucose. OsMYBS3 did not significantly 
enhance luciferase activity in the presence of glucose and repressed luciferase activity in the 
absence of glucose. These results indicate that OsMYBSl and OsMYBS2 function as a strong 
and weak activator, respectively, and that OsMYBS3 serves as a repressor, for transcription of 
the promoter containing SRS. 
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7. OsMYBSl Derepresses Sugar-repressed Transcription of a Promoter Containing the 
TATCCA Element with Adjacent Flanking Sequences 

To determine whether OsMYBS transactivates a promoter containing only the TATCCA 
element and adjacent flanking sequences, a DNA fragment containing 6 tandem repeats of the - 

5 133 to -82 sequence of SRS, designated 6x(TATCCA+F), was fused to the CaMV35S minimal 
promoter. A plasmid containing the Ubi-OsMYBS construct served as an effector, and Luc 
cDNA fused downstream of the 6x(TATCCA+F)-CaMV35S promoter served as a reporter. The 
barley half seeds were co-bombarded with the effector and reporter plasmids. The bombarded 
half seeds were divided into two halves and each half was incubated with 0.3 M glucose or 0.3 M 

10 mannitol for 24 h, and luciferase activity was determined. OsMYBSl significantly enhanced 
luciferase activity in the presence of glucose to the same level as in the absence of glucose. 
Although OsMYBS2 and OsMYBS3 seemed to slightly enhance luciferase expression in the 
presence of glucose, they significantly suppressed luciferase activity in the absence of glucose. 
This result indicates that OsMYBSl activates, while OsMYBS2 and OsMYBS3 repress, 

1 5 transcription of a promoter containing only the TATCCA element and adj acent flanking 
sequences. 

8. OsMYBS Proteins Cooperate with HvMYBGa in Transactivation of a Barley Amy32b 
Promoter 

The GARC present in ct-amylase gene promoters includes a GARE and the TATCCA 
20 element (Gubler and Jacobsen (1992) Plant Cell 4, 1435-1441 ; Lanahan, et al. (1992) Plant Cell 
4, 203-21 1). HvMYBGa has been shown to bind specifically to GARE in vitro, and to activate 
transcription of barley high-pi a-amylase and cysteine proteinase gene promoters in the absence 
of GA in vivo (Gubler, et al. (1995) Plant Cell 7, 1879-1891; Cercos, et al. (1999) Plant J. 19, 
107-1 18). Since the TATCCA element is essential for high-level GA-activated transcription of 
25 a-amylase gene promoters (Gubler and Jacobsen (1992) Plant Cell 4, 1435-1441; Lanahan, et al. 
(1992) Plant Cell 4, 203-21 1), whether or not the OsMYBS proteins cooperate with HvMYBGa 
in enhancing transcription of the a-amylase gene promoter was tested. The barley half seeds 
were co-bombarded with an effector plasmid containing the Ubi-HvMYBGa construct, another 
effector plasmid containing the Ubi-OsMYBS construct, and the reporter plasmid containing the 
30 Amy32b-GUS construct. GUS was expressed only in the presence of GA when no HvMYBGa 
was present, and overexpression of HvMYBGa significantly enhanced GUS activity in the 
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absence of GA. This result is consistent with other results reported in the literature (Gubler, et 
al. (1995) Plant Cell 7, 1879-1891; Cercos, et al. (1999) Plant J. 19, 107-1 18). Interestingly, co- 
expression of HvMYBGa with OsMYBSl, OsMYBS2, or OsMYBS3 further enhanced GUS 
activity by approximately 2-3-fold of that with expression of HvMYBGa alone in the absence of 
5 GA. These results suggest that the three OsMYBS proteins cooperate with HvMYBGa in 
transactivation of the barley ot-amylase gene promoter. 

9. Suppressor of OsMYBS Proteins 

Rice embryos were co-bombarded using a standard method with a reporter plasmid 
p3Luc.l8 and an effector construct encoding Ubi fusion protein Ubi-OsMYBSl or Ubi-OsBZ8. 
10 OsBZ8 is a bZIP protein. The bombarded rice embryos were divided into two halves, each half 
was incubated in the presence or absence of 100 mM glucose for 18 hours. Luciferase assays 
were then carried out as described above to determine effects of OsBZ8 on the transactivity of 
OsMYBSl . The results indicate that OsBZ8 completely suppressed the transactivity of 
OsMYBSl. 

15 Rice embryos were also co-bombarded with a reporter construct T7 containing a 

modified aAmy8SRS/GARS promoter linked to the CaMV35S minimal promoter-Luc chimeric 
gene, an effector plasmid containing the Ubi-OsMYBl, and another effector plasmid encoding 
Ubi-HvMYBGa or AtABI5. AtABI5 is a G-box binding bZIP factor (Finkelstein et al., (2000) 
Plant Cell 12: 599-609). Luciferase assays were carried out by a standard method. The results 

20 indicate that HvMYBGa and AtABIS increased the ability of OsMYBl to transactivate the 
expression of the reported gene by 4-5 and 2-3 fold, respectively. 

10. OsMybSl Forms A Homodimer in Barley Aleurone Cells 

In c-Myb, R2 and R3 bind to specific DNA sequences cooperatively. To test the 
possibility that OsMYBS binds DNA as dimers, homologous and heterologous interactions 

25 among OsMYBS proteins were analyzed using a two-hybrid system in barley aleurone cells. 

The full-length OsMYBS sequences were individually fused at the C-terminus of GAD or GBD. 
The GAD-OsMYBS and GBD-OsMYBS chimeric genes were fused downstream of the Ubi 
promoter and used as effector constructs. A DNA fragment containing five tandem repeats of 
UAS (5xUAS) was fused upstream of the CaMV35S minimal promoter-Luc chimeric gene and 

30 used as a reporter construct. The barley half seeds were particle-bombarded simultaneously with 
two effector plasmids, each containing GAD-OsMYBS and GBD-OsMYBS fusion genes, and 
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the reporter plasmid. The bombarded half seeds were incubated in buffer without glucose for 24 
h, and luciferase activity was determined. Coexpression of GBD-OsMYBSl and GAD- 
OsMYBSl significantly enhanced luciferase expression compared with the expression of GBD- 
OsMYBSl alone. However, coexpression of GBD-OsMYBS 1 and GAD-OsMYBS2 or GAD- 
OsMYBS3 did not further enhance luciferase activity compared with the expression of GBD- 
OsMYBSl alone. Similar experiments with OsMYBS2 and OsMYBS3 showed that the two 
proteins did not interact homologously or heterologously. To confirm that only OsMYBSl 
interact homologously with one another and the three OsMYBS proteins do not interact with 
each other, the effector and reporter plasmids were reconstructed for expression in yeast. Except 
that the background level of reporter gene expression in the presence of GBD-OsMYBSl alone 
is too high to detect the interactions between GBD-OsMYBSl and GAD-OsMYBS proteins, 
similar results were obtained with the yeast two-hybrid experiments. These results suggest that 
only OsMYBSl forms homodimer under these experimental conditions. 

OTHER EMBODIMENTS 

All of the features disclosed in this specification may be combined in any combination. 
Each feature disclosed in this specification may be replaced by an alternative feature serving the 
same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature 
disclosed is only an example of a generic series of equivalent or similar features. 

From the above description, one skilled in the art can easily ascertain the essential 
characteristics of the present invention, and without departing from the spirit and scope thereof, 
can make various changes and modifications of the invention to adapt it to various usages and 
conditions. Thus, other embodiments are also within the scope of the following claims. 
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